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ABSTRACT 

We discuss the very different roles of the valence-quark and the sea- 
quark residual masses (m" es and m^) in dynamical domain- wall fermions 
simulations. Focusing on matrix elements of the effective weak hamilto- 
nian containing a power divergence, we find that m" es can be a source of a 
much bigger systematic error. To keep all systematic errors due to resid- 
ual masses at the 1% level, we estimate that one needs am r s es < 1CT 3 and 
am^ es < 10~ 5 , at a lattice spacing a ~ 0.1 fm. The practical implications 
are that (1) optimal use of computer resources calls for a mixed scheme 
with different domain-wall fermion actions for the valence and sea quarks; 
(2) better domain-wall fermion actions are needed for both the sea and 
the valence sectors. 



1. Introduction 



Lattice QCD is entering a stage in which it is becoming more standard to include 
dynamical fermions (i.e. sea quarks) in numerical simulations, in order to move away 
from the quenched approximation. Needless to say, this step is crucial if lattice QCD 
is to become really predictive, with full control of all systematic errors. 

What has not changed is that simulations with dynamical fermions are very 
expensive: updating the fermion determinant takes the bulk of the computer time. 
It is therefore of key importance to choose the parameters of any simulation such 
that the desired precision of physical observables is obtained. However, in order 
to use resources wisely, it is equally important to balance the effort with respect 
to the various systematic errors involved. It is questionable to spend an inordinate 
amount of computational effort to control one type of error to machine precision, 
while not controlling some other error to better than, for instance, one percent. This 
is especially the case if the difference amounts to making the simulation feasible or 
not. 

In the case of domain- wall fermions (DWF) PIE], an important systematic error 
comes from the explicit breaking of chiral symmetry due to the fact that the size of 
the fifth dimension, L5, is kept finite. Chiral symmetry is better at larger L5, but 
the computational cost also grows with L5, making the choice of L5 an optimization 
problem. Since the cost of simulating the fermion determinant is so large, this opti- 
mization problem is particularly acute if one wishes to simulate unquenched lattice 
QCD using DWF for both the valence- and sea-quark sectors. 

In this paper we propose that, given a certain DWF discretization of the Dirac 
operator, the size of the fifth dimension entering in the sea-quark Dirac operator, 
L£, may be chosen much smaller than its valence counterpart, L^. 1 This could be 
of substantial help in reducing the cost of simulations with dynamical DWF, while 
keeping control over systematic errors due to the lattice-artifact chiral symmetry 
breaking coming from a finite L5. 

Since the quality of chiral symmetry is most important in weak hadronic decays, 
we will center our discussion on the well-known case of non-leptonic kaon decays. In 
particular, we have in mind the method, proposed long ago [3], of determining physical 
kaon-decay matrix elements from simpler, unphysical ones, such as K + — > 7r + and 
using chiral symmetry. As is well known, power divergences are encountered 
in this computation, and good chiral symmetry is a key ingredient in controlling it. 

We begin with a list of observations, on which this proposal is based. The rest of 
the paper elaborates on these observations. 

Observations 

1. Violations of chiral symmetry in Ward-Takahashi (WT) identities and in phys- 
ical matrix elements arise from the valence sector only, essentially by definition. 
They vanish when approaching the valence- DWF chiral limit L\ — > 00, in which 

1 Superscripts v and s refer to quantities in the valence- and sea-quark sectors, respectively. 
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the "residual quark mass" (i.e. the effective quark mass due to a finite L 5 when 
the explicit quark mass has been set equal to zero) m" es goes to zero, regardless 
of the value of m* es . 

2. Consider any operator Q belonging to the effective weak hamiltonian, trans- 
forming in (8l, 1r) of SU(3)l x SU(3)r. If m% es = then, for any m* es , 
its power-divergent piece arises from a single lower-dimension operator 02. 
One subtraction, Q su b = Q — TO2, defined by the renormalization condition 
(0| <2sub = 0, removes the power divergences from all of its matrix elements 
simultaneously. 

3. We may constrain m v res and m* es by the requirement that m res -induced deviations 
of physical matrix elements from the chiral limit will, for instance, be at the 
1% level. Since the systematic effects due to m" es 7^ are magnified by inverse 
powers of the lattice spacing, we end up with a rather tight constraint on m" es . 

4. Systematic effects due to m* cs 7^ alone are not magnified by inverse powers 
of the lattice spacing when following the subtraction procedure defined above. 
Therefore, chiral perturbation theory gives rise to a milder bound on m* es . A 
comparable bound arises from considering m* cs -induced chiral-symmetry viola- 
tions in the mixing among dimension-six operators as well as scaling violations. 

Allowing for m* es to be much less constrained than m v res translates into the pos- 
sibility of choosing L s 5 much smaller than L\. This, in turn, may lead to a sub- 
stantial economy of computer time, making dynamical DWF simulations possible on 
currently-planned machines. 

The rest of this paper is organized as follows. In Sect. 2, we explain the main 
observations (1 and 2) in more detail, paying particular attention to power-divergent 
subtractions. In Sect. 3 we analyze the situation at next-to-leading order (NLO) 
in chiral perturbation theory (ChPT), with emphasis on the role of m^. As this 
knowledge will be crucial for the bounds we intend to establish, we give a rather 
detailed account of the way power divergences relate to low-energy constants (LECs) 
in ChPT to NLO, including a discussion of short- versus long-distance effects. Sect. 4 
then gives estimates of the numerical bounds which m^l need to satisfy in order to 
determine the leading-order (LO) LECs relevant for K — > -kit decay to about 1% 
accuracy, at an inverse lattice spacing of about 2 GeV. In Sect. 5 we discuss the 
implications of our results. The systematics of residual chiral symmetry violating 
effects is discussed in Appendix A. 

2. Power divergences 

2.1 Setup and (chiral) symmetry considerations (Observation 1) 

It is easiest to illustrate the very different roles of the sea and valence chiral sym- 
metries in a simplified, extreme case where the valence quarks have an exact chiral 
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symmetry (for m v q = 0), while the sea quarks are Wilson fermions, with basically 
no chiral symmetry at finite lattice spacing. This "mixed" framework was studied 
recently in a chiral- lagrangian framework in ref. 0]. 

We will assume the number of valence quarks to be N v , and the number of sea 
quarks to be N s . With a mixed action, there is no (exact) symmetry that mixes the 
valence and sea quarks. With Wilson sea quarks and Ginsparg- Wilson valence (and 
ghost) quarks, the lattice flavor symmetries are 

G sea = U(N S ) V 

G val = [SU(N V \N V ) L ® SU(N v \N v ) R ](s)U(l) v . (2.1) 

The (s) symbol denotes a semi-direct product. 

Let us now consider an axial WT identity. It is obtained by applying a (local) ax- 
ial transformation to the valence quarks only (and not to either sea or ghost quarks). 
Since the (global) non-singlet axial symmetry of the valence quarks is exact, there 
are no lattice-artifact chiral symmetry violating terms in the resulting WT identity. 
This is true in particular for the PCAC relation, implying that the (valence-quark) 
pion mass must vanish exactly for m v = 0. Whether or not the sea quarks have an 
axial symmetry plays no role here! All that is necessary is that the gauge fields do 
not transform under the flavor symmetries, and therefore there is no way to "com- 
municate" (axial) flavor transformations applied to valence quarks to the sea quarks. 

2.2 Application to K — > n penguins (Observation 2) 

Similar considerations apply to any operator mixing which occurs in the calculation 
of physical matrix elements. The basic reason is that, by definition, the operators 
that interpolate the external states are made of valence quarks only. This can be best 
illustrated through an example. 

In the calculation of matrix elements needed for K — > 7T7r, the most dangerous 
mixings are the power-divergent subtractions. This allows the four-fermion operators 
occurring in those matrix elements to mix with the fermion bilinear [3] 

2 = s 7/ X^-^)(l- 75 )d 

= (m s + m d )sd + (m d - m s )sj 5 d , (2.2) 

where = denotes on-shell equality. 2 In full continuum QCD the operators that can 
mix with a fermion bilinear are the (8^, 1#) ones. In a hybrid scheme, the four-fermion 
operator must contain valence s and d. The other two fermions may be valence, sea, 
or ghost quarks. When the valence sector has the exact chiral symmetry of Eq. (j2.1j) 
with N v — 3, the effective weak hamiltonian contains four-fermion operators which 
transform as (8^, 1^) under SU(3)l x SU(3)r C G va i. They can thus mix with an sd 
bilinear with the same quantum numbers (including CPS symmetry 3], under which 

2 In this paper, we only consider matrix elements with on-shell external states, and we may thus 
use the equations of motion. However, we will allow for matrix elements which do not conserve 
momentum. 
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the weak hamiltonian is even). 3 In order to be able to use the G va \ symmetry in 
constructing such bilinears, the quark mass matrix M is promoted to a spurion field, 
transforming as M — > LMR^, with L, R e SU(N v \N v )L t R- The only fermion bilinear 
with dimension smaller than six satisfying these requirements is 02. This operator 
has mass-dimension four, and hence it mixes with the octet weak hamiltonian through 
a coefficient proportional to 1/a 2 . It also follows that there is no mixing of order 1/a, 
because a dimension-five operator with the appropriate quantum numbers does not 
exist. 



2.3 Renormalization condition for the power divergence 

In field theory one renormalizes operators, not individual matrix elements. 4 Given a 
four- fermion operator Q which mixes with O2, we thus need to define a subtracted 
operator 

Qsub = Q-^0 2 . (2.3) 

Here T is a dimensionful function of the parameters of the theory which diverges like 
1/a 2 . Among these parameters are the explicit quark masses contained in the lattice 
action. If we assign to the mass parameters their spurion transformation properties, 
JF must be invariant under all symmetries (for the hybrid theory, this includes in 
particular the symmetries of Eq. (j2.1j) and CPS). 

As already mentioned, no lower-dimension operator other than O2 has the quan- 
tum numbers of Q. (This is not true if m" es 7^ 0, see below.) Hence, a single 
renormalization condition is sufficient to determine the subtraction. Moreover, the 
single subtraction (|2.Hjl must remove the entire power-divergent part of Q (see be- 
low for logarithmic divergences). Here we choose the homogeneous renormalization 
condition 5 



= (0 



a 



sub 



which yields 



Q K°) - T (m d - m s 



Q\K°) 



s-f 5 d 



(2.4) 



(2.5) 



(m d -m s ) (0\s l5 d\K°) 

Equation (J2.4|) is a consistent renormalization condition because, for 7^ m s , the 
power-divergent part of Q contributes to the matrix element at LO in ChPT. Indeed, 
the power divergence corresponds to (the matrix element of) the operator s^d. This 
operator is a total divergence on shell, which contributes to K° — ■> because momen- 
tum is not conserved in that matrix element Since momentum is conserved for 
the physical K — > tttt matrix elements, one does have that (7T7r| 2 \K) = 0. 

Before we move on, we need to discuss the freedom in choosing the renormal- 
ization condition. Generically, a physical renormalization condition may be written 



3 In the case of the strong penguins Qs,6 only the "partially-quenched singlet (PQS)" operators 
of ref. can mix with sd. In this paper, we consider only PQS operators. 
For a similar discussion in this context, see Ref. (Hj- 

5 This renormalization condition was also discussed in the context of the direct determination of 
K — > 7T7T matrix elements in ref. [7|. 
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as (A\ Q su b \B) = C, where \A) and \B) are some states, and the (not necessarily 
zero) constant C is finite in physical units. How does changing the renormalization 
condition affect the subtracted operator and its matrix elements? Let us expand the 
subtraction coefficient in Eq. (|2.3|) as a power series in a: 

F{a) = ^ + F + O(a). (2.6) 

Or 

If we choose a different renormalization condition, we will end up with a new sub- 
traction coefficient 

r(a) = ^ + ^ + 0(a). (2.7) 



a 



How are J- (a) and JF'(a) related? Let Q' suh = Q — T'Gi- Then 

Q' suh K°) = (OQ K )~m + ^ + 0(a)) (m d -m s )(0s l5 d K°) (2.8) 



'T 2 - r 2 

a? 



+ Tq - T'q + 0(a) (m d - m s )(0 s-f 5 d 



The last equality is true because (0| <2 su b \K°) = by construction. We see that 
(0| Q' snh \ K°) is finite if and only if Ti = Now, since Q can mix with only one 
lower-dimension operator, it follows that any consistent renormalization condition 
must give rise to the same value for JF 2 . In other words, the power-divergent part of 
T is independent of the renormalization condition, but the finite part is not. (That 
is, in general JF ^ T'§. Of course, the same is true for the 0(d) part.) In the 
case at hand, the subtraction defined by Eq. ()2.3|) and the renormalization condi- 
tion (0| Q su b \K°) = ensures the absence of power divergences in (ir\ Q suh \K) and 
(7T7r| Q sub \K) simultaneously. Denoting the coefficient of 1/a 2 in the divergent part 
of a matrix element by u div" this statement can be summarized in the suggestive 
form 

div(A\Q\B) = div (0|Qg) 
(A\0 2 \B) (0| O a |i^°> 2 ' 1 • ] 

for any two states (A\ and \B). 

The matrix elements of Q su b will in general still contain logarithmic divergences. 
Fully renormalized operators may be defined on the lattice using e.g. the regulariza- 
tion independent (RI) non-perturbative renormalization (NPR) method [El El- (For 
a different method, see ref. fSj.) Renormalized operators = ZjQ 3 suh are defined 
by the requirement that their matrix element between given (gauge fixed, off shell) 
high-momentum four-quark states has a prescribed value. Typically, one performs a 
second (finite) renormalization in order to relate the lattice-renormalized operators 
to renormalized operators defined in some continuum scheme such as MS. 

The entire renormalization procedure discussed above does not rely on the ef- 
fective chiral lagrangian. The numerical extraction of JF- 7 can be done directly from 
Eq. (|2.4j) . Similarly, the NPR matrix Zj is determined numerically. In the case at 
hand, we obtain renormalized K — > ir matrix elements (ignoring for simplicity the 
renormalization of the external states) via 



H Q r \K) = Z) (tt| Q J suh \K) = Z) (<tt| Q? \K) - T\m d + m s ) (tt| sd \K)) . (2.10) 
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This result can now be matched on to the appropriate ChPT expression. The LECs 
obtained this way are then used to determine the K — > ttti matrix elements. 

While its is clear that the complete renormalization is carried out without using 
ChPT, the ChPT calculation must conform to the actual renormalization procedure 
that we envisage performing numerically. We will return to this point below. 



3. Chiral perturbation theory 

In this section we work out a ChPT example. The central point is that, as long 
as m v tes = 0, the above procedure completely removes the power-divergences even if 
both the un-subtracted Q and O2 matrix elements depend on m s res . 

3.1 Matrix elements of Q and 2 for m rcs = 

We begin with the representation of a general weak (8^, 1#) operator to NLO in ChPT 
in the continuum, working in the limit in which = 0. In this case, the theory is 
partially quenched |10| . with an enlarged symmetry group 6 

[SU(N V + N S \N V ) L x SU(N V + N S \N V ) R ] x U(l) v . (3.1) 

Any octet operator Q mediating AS* = 1 transitions can be written as (in euclidean 
space) 7 

Q = 0,2 + Qa + ■ ■ ■ , (3.2) 
Q 2 = -a?str(AL M L M ) + a^str(AX+) , 

in which af and (3f denote LECs at LO and NLO, respectively, and 

A} = M i2 . (3.3) 

In terms of the non-linear field S = exp (22$//), 

Lfj, = i£d M £t, (3.4) 
X± = 2B (EM t ± MS f ) . 

M is the (diagonal) quark mass matrix containing the valence, sea and ghost quark 
masses and, after promotion to a spurion field, transforming just like S under the 
full chiral group Gl x Gr, E — > LluR) , with L,R e Gl,r- The list of NLO operators 
relevant for our purposes is [T3] 

Of = str(AX+X+), (3.5) 

6 For our purposes, it is sufficient to consider this symmetry group, even though the actual sym- 
metry group is slightly different, see Refs. 

7 We will use the notations and conventions of ref. • 
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= str(AX+) str(X + ) , 


ol 


= str(AX_X_), 


o\ 


= str(AX_) str(X_) , 


ol 


= str(A[X + ,X_]), 


O s 10 


= str(A{X + , LpLj) , 


o 8 


= str(AL M X + L M ) , 


o s 


= str(AX + ) str^L^) 


o s 


= str(AL M L M ) str(X + ) 


o 8 


= str(A[X_,L M L M ]). 



There are two total-derivative operators relevant for our discussion |12j . 

Of = id fM str(A[L lx ,X + ]) , 61 = z<9 M str(A{L M ,X_}) . (3.6) 

In partially quenched QCD, Of A is an independent operator, which cannot be written 
in terms of the other operators |14j . 8 However, the operators Of, Of and Of are not 
independent: 

O 8 - O 8 + O 8 = 4 str(AMM t ) = , (3.7) 

where the latter equality holds because M is diagonal. Henceforth, we will set /3| = 0. 

We also need the ChPT representation of the operator O2 to NLO. This operator 
can be written as the AS = 1 part of the variation of the (continuum) QCD lagrangian 
under a chiral rotation which transforms M — > (A + A*)M. Since the QCD lagrangian 
is represented in ChPT by the strong chiral lagrangian, all we need to do is to perform 
this same chiral rotation on the strong chiral lagrangian. Doing this, keeping only 
the A (AS* = +1) part, one obtains jTB] 

2 = ^str(AX + ) (3.8) 

o 

+ (L 8 + l -H 2 )0 8 + 2L Q 8 2 + (L 8 - \h 2 )0 8 + 2L 7 8 A + l -H 2 8 
+h 5 o 8 w + L,0 8 3 - h 5 of 5 + L^Ol + 61) , 

where Lj and H 2 are 0(p 4 ) Gasser-Leutwyler coefficients [T^]; Of + (9| is the chiral 
rotation of 2id^ str(L At X_), and Li is a new strong LEC multiplying this operator. We 
have added this total-derivative operator just as we did in Eq. (|3.2j) . (The operator 
2 is a local operator, not integrated over space-time. So, more precisely, one may 
obtain the operator 2 by doing the chiral rotation on the strong energy-momentum 
tensor, which may contain total-derivative terms.) We may set H 2 = because of 
Eq. (HUD. 

It is now straightforward to calculate J 7 , defined in Eq. (12. 5 j) . to NLO in ChPT, 
using the results for (0| Q \K°) from ref. [12], and Eq. ()3.8|) above. Since at LO only 

8 A point which has been overlooked in ref. |12| . 
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str(AX + ) contributes to K° — > 0, the chiral logarithms proportional to a 8 , drop out 
of J 7 , and we find 



T = — 

P 



a 8 + a\ x (chiral logs) (3.9) 



+ ZTT^ ( ( 4/3 i " 32As "2 + " 16A?al)(m. + m d ) + (4/3 2 8 - 64A 6 a 8 ) E 



(4tt/) 
in which 

A, = 1671-%. (3.10) 

We may now use this result for T to define the subtracted operator Q su b in 
ChPT, and calculate its K + — > 7r + matrix element for degenerate valence quark 
masses m v . We find that the LO terms proportional to a 8 ,, as well as the chiral 
logarithms generated at NLO by the corresponding operators, cancel in the matrix 
elements \ Q \K + ) and T \ 2 \K + ), leaving us with the result 



7T + 



Qsub 

4B, 



8B m- 



af(l + chiral logs) (3.11) 



P 

J^U (2$ + Wfo + fti + 8X 5 (af - a 8 ) - 16A 8 (a 8 + a 8 ) - ft + 8A> 8 ) m v 
+ (/? 8 4 - 16A 6 a 8 + 8A 4 a?) E ; 

i 

where we took the limit m s —>■ ma = m v in expression (J3.9j) for T . 

The "chiral logs" are those coming from the af operator in Eq. ()3.2|) . and an 
explicit expression can be found in ref. J2|- 9 The NLO terms proportional to af 
come from the fact that we expressed the common mass M p h ys of the physical (valence) 
meson, which appears at LO in the af term, in terms of the valence quark mass m v 
and the sea quark masses m|, using 



(M phys ) 2 = 2B m v 1 1 + chiral logs + ° 2L " 



(2A 8 -A 5 V + (2A 6 -A 4 )E 



m 4 



(47T/) 2 

(3.12) 

Finally, we omitted wave-function renormalizations for the states \K + ) and |7r + ). 

Let us discuss what we learn from these results. In the NLO part of Eq. ()3.11)1 . 
m v and J2i mf are multiplied by linear combinations of LECs having the generic form 

E^-^E^, (3.13) 

» 3 

where Cj and o?j are numerical coefficients. We know from the general arguments of 
Sect. 2 that the subtracted matrix element in Eq. (|3.11|) contains no power divergence. 
Since m v and m| are free parameters, it follows that the linear combinations ()3.13|) 
are not power divergent either! This simple observation leads to several important 
conclusions. 



3 See Eq. (4.19) of that paper. 
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Corollary 1. Some of the LECs j3f diverge like I /a 2 , just like cuf. Proof: The linear 
combination J2j djXj occurring in expression (J3.13j) will in general be non-zero. Since 
a| diverges like 1/a 2 , so must J2i c iPi- (This observation is implicit in the work of 
ref. HU.) 

Corollary 2. At NLO, un-subtracted matrix elements contain contributions that 
diverge like (l/a 2 )(Bom/ f 2 ) n for both n = 1 and n = 2. (Here m stands for a 
generic mass parameter.) Proof: For n = 1 this is evident from the LO result. 
For n = 2, observe that whenever Of occurs in the matrix element (tt + \ 2 \K + ) 
with a non-zero coefficient, the corresponding (power divergent!) LEC /3f occurs in 
(/T + | Q \K + ), where it is multiplied by (B m/ f 2 ) 2 - It is quite evident that, as we go to 
higher orders in ChPT, un-subtracted matrix elements will contain divergent pieces 
(l/a 2 )(B m/ f 2 ) n with correspondingly higher values of n. 

The last observation may seem surprising at first, because intuitively one expects 
that short-distance divergences should be insensitive to long-distance effects. We 
believe that this intuitive notion is, in fact, correct, but it has to be stated more 
carefully. This leads us to the following two conjectures. 

Conjecture 1. For any operator Of occurring in the NLO representation of O2, the 
linear combination /5| ub = (3i — Y^j e ij^j contains no power divergence. Here the 
linear combination Ylj e ij^j is equal to the combination of strong LECs that multiplies 
Of in Eq. (HI), 10 with Lj -> A^. 11 

It is easy to understand why the combinations P^ nb should be finite. Seven of 
the NLO operators Of in Eq. ()3.5|) as well as the two total-derivative operators in 
Eq. ()3.6|) occur in the representation of 2 (Eq. ()3.8|) ). Separately, the matrix elements 
(A\ Q\B) and J r (A\0 2 \B) each diverge like 1/a 2 . But taken together they yield 
(^4| Qsub 1-6), which must be finite. If we now consider a sufficiently large collection 
of matrix elements (corresponding to various choices of (A\ and/or \B)), we expect 
that, for each i, (A\ Of \B) will depend in a different way on all the quark masses. 
The obvious way for all the subtracted matrix elements to be finite simultaneously 
is, therefore, that the /9| ub be finite. As an example of this, the combinations /3f A and 
2(/3f - 8X 8 a 8 2 ) + 2((3f - AX 5 a 8 2 ) + (3^ - 0f - 8Aio^) in Eq. (l3~TH) have to be finite. 
We thus believe that it is only a matter of more work to prove that all the /?f ub 's are 
in fact finite. 

Our next conjecture deals with the separation of short- and long-distance effects. 

Conjecture 2. The subtraction coefficient T defined in Eq. f!2.4j) contains no diver- 
gences of the form (l/a 2 )(B Q m/ f 2 ) n for n > 1. In other words, T 2 (Eq. ()2.6|0 corre- 
sponds only to the a\ term in Eq. ()3.9|) and is independent of all mass parameters, 
while the finite To corresponds to the entire NLO part of Eq. (J3.9j) . 

The evidence supporting this conjecture is that the NLO part of J 7 , Eq. ()3.9|) . can 
be expressed as a function of (3^ ub only. Conjecture 2 thus follows from Conjecture 1, 
to NLO. We believe that this demonstrates in the case at hand how the short-distance 

10 By inspection, for each i, dj is non-zero only for one value of j. 

11 This claim is already contained in ref. ^3], see in particular Table 5 therein. 
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divergences are disentangled from the long-distance physics. 

In matrix elements, the short-distance divergences get multiplied by factors that 
originate from the long-distance physics of the matrix element. It should therefore 
come as no surprise that un- subtracted matrix element contain divergences of the form 
(l/a 2 )(B m/f 2 ) n for n > 1. 

Our results for T and (7r + | (9 S ub are different from those found in ref. [T4"j. 12 
The reason is that in ref. ^3] the subtraction condition Eq. (12. 4 j) was only imposed 
to LO in ChPT, taking T = 8a|/ f 2 instead of Eq. ()3.9|) . This amounts to using a 
different subtraction condition, because the NLO combinations f3f — 8\sal, Pf — SXfa^ 
and /3f — 16A6Q!2 are finite (up to logarithms), as we argued above. Thus, to NLO, 
the subtraction condition used in ref. [TJ] is of the form (0| Q' suh \ K°) = C ^ 0, with 
C finite. Comparing to Eq. ()2.8|) . indeed Ti — T'^ but To — T' Q does not vanish, and 
is given by the NLO terms of Eq. ()3.9|) . 

While both subtraction conditions thus remove power divergences from all matrix 
elements, the subtraction condition of Eq. (|2.4j) is easier to implement numerically, 
because no reference to ChPT is made. The condition discussed in ref. ^3] distin- 
guishes between LO and NLO, and therefore its implementation requires fitting to 
the appropriate ChPT expressions. 

Finally, we emphasize again that physical results do not depend on which sub- 
traction condition is chosen, because (7r + 7r~| O2 = 0. As already mentioned, at 
the level of quark operators, this equality follows from the fact that the parity-odd 
part of O2 is a total divergence on-shell; this argument can be carried over to ChPT 
using the chiral WT identities. 

3.2 The case m res ^ 

We now investigate how the situation changes when m res 7^ 0. The chiral symmetry 
breaking, present in lattice QCD with domain-wall fermions due to the fact that the 
limit L5 — > 00 is not taken, can be represented by a local operator transforming in 
the same way as an explicit mass term [2|. We take the effect of this chiral symmetry 
breaking into account by introducing a new spurion field M res , which is then set equal 
to 

M res = diag(m^ es , . . . , m^ es , . . . , r< es , . . .) . (3.14) 

M res is expected to be small if both valence and sea quarks are well outside the Aoki 
phase J7j. The first and last set of entries refer to the valence and ghost sectors, 
while the middle set of entries refers to the sea sector. Both m v res and m s ves are flavor 
independent. (For a more detailed discussion of the systematics of residual chiral 
symmetry violations, see Appendix A.) We may thus define new building blocks for 
operators in the effective theory [To] : 

X™ s = 2S (EM r t es ± M rcs £t) . (3.15) 

The choice of 2B as a proportionality factor defines the normalization of m^. 
12 See Eq. (62) of that paper. 
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The presence of M res leads to new C PS-even operators in the effective theory: 

a®str(AX+) -> # 2 str(AX^ es ), (3.16) 
(3\Ol -> r ?1>1 str(A{X + ,X; es })+r /l , 2 str(AX- s X; es ), 
PlOl - r /2il str(AX + )str(X- s )+r ?2i2 str(AX- s )str(X + ) 

+ r ?2 , 3 str(AX; cs )str(X; es ), 
/3 3 8 C»3 -> 773,1 str(A{X_, X! 68 }) + 773,2 str(AX! cs X! es ) , 

-> ^ 4 ,istr(AX_)str(X! es ) +r/ 4 , 2 str(AX! cs )str(X_) 

+ r7 4i3 str(AX! cs )str(X! es ) , 
/M - *7i str(A{X» L^}), 
PnOli - T/nstrCAL^L^), 
PfsOf, - r /l3 str(AX; cs )str(L At L^), 
/?! 8 4^ 4 - r/ 14 str(AL At L /t )str(X; es ), 
/Ms - r/ 15 str(A[X- L^]). 

In the strong effective lagrangian, there are new LO and NLO operators of the form 
(giving only the ones relevant for the matrix elements of interest) 

str(X+) -> str(X; es ), (3.17) 

A 4 str(L^)str(X + ) -> k 4 str(L M L (U ) str(X; es ) , 

A 5 str(L^L M X+) -> k 5 str(L At L M X^ cs ) , 

A 6 (strX+) 2 -> Kg,! str(X+) str(X; cs ) + k 6i2 (strX; es ) 2 , 

A 7 (strX_) 2 -> k 7i i str(X_) str(X res ) + k 7j2 (strXL es ) 2 , 

A 8 str(X 2 +X 2 ) -> / ?8 , 1 str(X + X; es + X_XI es )+K 8 , 2 str((X; cs ) 2 + (X! es ) 2 ). 

The contributions of these new operators to (9 2 can easily be obtained by applying 
the chiral rotation M — ► (A + AT)M, M res — > (A + A^)M res to these new operators. 

First consider the case that only m* es ^ 0, while m^ es = 0, i.e. the valence quarks 
have the same chiral symmetry as in the continuum theory. Clearly, for fixed, non- 
zero m* es , the symmetry group of Eq. (|3.1|) gets broken to the group of Eq. (12. ip . 
Keeping only tree-level contributions from both LO operators and from the NLO 
operators listed in Eq. ()3.16|) . we find that 

n n vro\ 4iB„(m a - m d ) f 8 AB 
OQK) = ^ a2 + —— r]2lNsm 

(3.19) 

in which m v is again the degenerate valence quark mass in the K + — > tt + decay. 13 One 

13 We see that all that happens is that, in the un-subtracted matrix elements, the LECs a\ and a| 
"get shifted" by an amount proportional to m^ cs . This is a natural consequence of the form of the 
relevant NLO operators. The reader can verify that any "shift" in a\ drops out of the subtracted 
matrix element. 
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can show that 772,1 also diverges like 1/a 2 , using arguments similar to those discussed 
in Sect. 3.1. The 772,1 dependent terms are thus new examples of Corollary 2. 

The subtraction of power divergences based on Eq. (|2.4|) still goes through un- 
changed. Explicitly, we find that 



T 



T 



32B 

+ — — ^tttt(^2,i - 8K 6A al)N s m s Tes , 



7T 



Q 



sub 



K~ 



7T 



mrcs=0 (4vr/) 2 / 2 

Qsub 



32B$m v 



(4vr/) 2 /2 



r/14 A^ s mJ 



(3.20) 
(3.21) 



This result shows that r/14 is finite. We also expect that rj^i = r/ 2i i — 8^6,1^2 i s a new 
finite combination of LECs, conform Conjectures 1 and 2. 

Of course there are corrections to these results: Eq. ()3.12|) . which expresses the 
physical pseudo-scalar mass in terms of chiral-lagrangian parameters at NLO, gets 
modified by the k LECs defined in Eq. (I3.17|) . Also the chiral logarithms get modified: 
they are those of ref. [T2], with the proviso that now M\ s = 2B (m s + m^). The 
corresponding O2 matrix elements needed for the subtraction get new contributions 
from the m^ es dependence of the strong chiral lagrangian to NLO. However, none of 
these corrections changes the conclusion that the subtraction procedure still works. 
The underlying reason is that the operator O2 in Eq. (|2.2j) is still uniquely the only 
lower- dimension operator that Q can mix with. 

The situation changes if we now also let m^ eB be non-zero. In that case, also 
the valence chiral symmetry in Eq. ()2.1|) gets broken, and another bilinear operator 
0' 2 = 2m^ es sd appears which can mix with Q and (9 2 . Let us see how this works 
in ChPT, again omitting mass and wave-function renormalizations, as well as chiral 
logarithms. Instead of Eqs. (J3.18J) and (J3.19J) we find 



Q 



Q 



K° 



m rcs =0 

l6iB$(m s - m d ) 



71 



Q 




{r] 2)1 N s m s res + 2rji tl m v ies ) 
+ 



m rcs =0 



8B Q m u res 8 

- V 2 ) 



P 



(3.22) 



(3.23) 



(^f?P 
32B%(m? + m 



(47T/) 2 / 2 



{2fa + fa)vn? + fa^ 

i J 

ri u N s m s ms + (2r/ 10 + r} U )m l T 



V 

res/ 



32B$m v . 

WJW 1 

(^f) 2 P 



r] 2<1 N s m s res + 4(771,1 + V3,i)m v res j 
(v2,3N s m s Iea + 2(t7i j2 + Vs^m^ + r/ 2 , 2 m i) ■ 



The terms involving af or j3f arise because the physical (tree-level) valence-meson 
mass gets modified to M^ hys = 2B (m v + m^ es ). 
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The problem one faces when m v res 7^ comes from the fact that two power- 
divergent subtractions (both of order 1/a 2 ) are needed now, because of the presence 
of the two lower- dimension operators O2 and 0' 2 . An analysis following the lines of 
Sect. 3.1 shows that the LECs 9 2 , rjij, 772,1, 77 3j j and 7710 all are expected to diverge as 
1/a 2 . Performing only one subtraction with O2 would still leave 1/a 2 divergences in 
the once-subtracted matrix elements. The only way out is to define a twice-subtracted 
operator 

Qtwo-sub = Q- T0 2 - T'0' 2 , (3.24) 

for which all matrix elements are finite, using two independent subtraction conditions. 

The terms that depend only on m v ves or m° es in the K + — > n + amplitude are 
in principle not a problem, since they can be separated out by varying m v and mf, 
and thus be discarded. But, due to the divergent LECs j3f , 771,1, t? 3i i and r] W , the 
slope of \ Q \K + ) as a function of m v , even after subtracting \ O2 \K + ) with 
T determined from Eq. ()2.4|) . remains power divergent. This magnifies the effect of 
m^ es , which therefore will need to be small enough to keep this systematic error under 
control. Of course, the m^ es dependence also introduces an error in the determination 
of af, but this error does not get magnified by 1/a 2 if m" es = (c/. Eq. (j3.21|) \ The 
conclusion is that the demand for good chiral symmetry in the valence sector is much 
more stringent than in the sea sector. 

As mentioned, the results of Eqs. ()3.22|) and (j3.23|) are not complete without chiral 
logarithms and mass and wave-function renormalizations. But these "decorations" 
do not alter our conclusions on the demand on the smallness of . 



4. Bounds 

In this section we will address the question: In order that the systematic error 
due to neglecting m res effects will be at the 1% level, how small should m v res and m s Tes 
be? Obtaining such bounds requires us to assume something on the magnitude of 
m res -dependent contributions, relative to the "typical" size of NLO effects in ChPT. 
To account for the possible occurrence of numerically large LECs we will include 
a "safety margin" of a factor 10. Namely, we will require that the magnitude of 
the neglected terms is l%o (0.1%) when LECs assume their "typical" anticipated 
magnitude. We will apply similar considerations to other systematic effects such as 
(logarithmic) operator mixings and scaling violations. 

4.1 How small should m^ es be? (Observation 3) 

The method of constructing K — > tttt matrix elements from K — » 7r and K — > 
is highly vulnerable to any non-zero m v res because, as we discussed in Sect. 3.2, for 
m^es 7^ 0, it takes two subtractions to remove all the power divergences. Once- 
subtracted matrix elements will therefore still contain power- divergent terms. 

The magnitude of the un-subtracted K + — > tt + matrix element is parametrically 
of order (l/a 2 )(B m/ f 2 ). Here and below, m should be understood as a shorthand 
for the linear combination of valence-quark masses relevant for the specific matrix 
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element. In any given numerical computation, the magnitude of the subtracted matrix 
element can be written as (A/ a 2 )(B m/ f 2 ) . Quenched simulations [HUH] with a -1 ~ 2 
GeV find that A is a few percent, 14 and we will use A « 1% in our bounds. 

Before we derive our bound, let us comment on the situation if one considers 
only LO in ChPT. The LEC oef is extracted from the slope of the once-subtracted 
K + — > tt + matrix element with respect to m v , while the divergence associated with 
0' 2 (the 62 term in Eq. (|3.23)l ) is given by the intercept. Since the slope is independent 
of the intercept, this procedure effectively amounts to doing both subtractions at the 
same time at LO. See e.g. Fig. 25 of ref. |E]. 15 As we showed in Sect. 3.2, this is an 
artifact of LO ChPT, and not a valid procedure at NLO. The fact that terms at NLO 
compete with LO terms does not indicate a failure of ChPT. What we have here is 
a situation where a divergent NLO contribution competes with a finite LO one. The 
problem would go away had we made two subtractions! In that case, the bound on 
m res w °uld be closer the bound on m s ref , (discussed in Sect 4.2 below). We re-examine 
this issue in Sect. 5. 

At m^ es 7^ 0, once-subtracted matrix elements will have NLO power divergences 
which are parametrically of order (l/a 2 )(Bom/ f 2 )Bom^ es / (4irf) 2 . Our bound follows 
from the requirement that this will not exceed l%o of the physical answer. Dropping 
common factors, we obtain 

B m v ie J(4irf) 2 < 0.001A w 1(T 5 . (4.1) 

It may be more useful to re-express this bound in terms of the quantity am^ es , which 
is the dimensionless residual mass measured directly in the simulation. Both B Q and 
Anf are roughly of order 1 GeV, and the same is true for the inverse lattice spacing 
used in numerical simulations. Hence, in practice, B /(47rf) 2 « a and we may re-write 
Eq. (0~U) as am v ves < 10" 5 . 

Taking Bom/(4:7rf) 2 as the chiral expansion parameter is based on the known 
values of NLO strong LECs. The LECs Aj (cf. Eq. ()3.10|l ) suitable for this parame- 
terization are indeed 0(1). What would happen if it turns out that, say, B m/ (47r/ 2 ) 
(rather than B m/(4:7rf) 2 ) is the "true" chiral expansion parameter for the relevant 
weak matrix elements? Since 4tt 10, would this change our estimate by the same 
amount? The answer is no! It is precisely because of this uncertainty that we have 
already included the safety margin of 10 in our estimate. Thus, requiring that om" es 
be of order 10~ 6 would amount to a safety margin of a factor 100. 

4.2 How small should m^ es be? (Observation 4) 

From now on we will assume that am v res < 10~ 5 , and, hence, that we may use all the 
results derived for m" es = 0. As for m* es , we may or may not neglect it the analysis 
of a simulation, and the resulting error will be slightly different in each case. 

Let us first assume that m* es is neglected, namely, one performs the ChPT match- 
ing at m v res = m^ es = 0. The obvious advantage of this choice is that the analysis 

14 This is consistent with Aq CD w 0.01/a 2 . 

15 From this figure one deduces that, like af , the LO LEC 6*2 (Eq. is parametrically of order 

1/a 2 , as expected. 
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involves the minimal number of NLO parameters as dictated by continuum ChPT. 
The neglected m* es -dependent terms are again NLO, but this time they contain no 
power divergence, because the single subtraction of Sect. 2 removes the entire power 
divergence for m v res = 0. We already know from the LO analysis that the magnitude 
of a "typical" finite LEC is A times the magnitude of a "typical" power-divergent 
LEC. The neglected terms are therefore of order (A / a 2 ) (Bom/ f 2 ) B m* es / (4n f) 2 , and 
this should be less than l%o times the LO finite result. Dropping all common factors 
(including A this time) gives rise to the simple result 

B m s r J(^f) 2 < 0.001, (4.2) 

As before, this bound is the same as aml oa < 0.001 if a^ 1 ~ 1 GeV. 

Compared to Eq. (j4.1|) . the bound ()4.2|) is far less stringent. We now consider also 
short-distance systematic errors induced by neglecting m* es . This includes mixings 
between dimension-six operators that are allowed by the symmetry of Eq. (j2.1|) but 
not by the full PQ symmetry (J3.1)) . Operator mixings are determined (e.g. using the 
RI NPR method [6 ) at off-shell momenta of roughly p « 1/a. The resulting "wrong- 
chirality" operator mixing will therefore be at most of order am^ es , and requiring this 
to be a l%o effect (safety margin included) we again end up with am^ es < 0.001. 16 

We next consider what scaling violations are induced at non-zero m^ es . At m" es = 
m res = 0' domain-wall fermions have only 0(a 2 ) scaling violations. Since clKqcd ~ 
0.1, this is a 1% effect. For m s res ^ we may have 0(a) scaling violations coming from 
the dimension-five operator qo^F^q. Exact chiral symmetry forbids this operator, 
hence it can at most arise with a coefficient of order am^. es . Requiring that this effect 
will be smaller than 0.1(aAgc£)) 2 we end up with the same bound as before. 

In summary, taking known sources of systematic error into account we conclude 
that, provided om^ < 0.001, all m^ es effects can be completely neglected and the 
resulting error due to m* es should be at the 1% level. 

Of course, several independent 1% errors might accumulate to a somewhat bigger 
error. If one does ChPT at non-zero m* es , the systematic error generated by the 
ChPT matching will be entirely due to NNLO effects. However, this would require 
more measurements to extract the new linear combinations of NLO LECs that occur 
in K° — > 7r + 7r~ when m* es 7^ 0. 

We anticipate that the numerical calculation of dimension-six operator mixing 
should be feasible assuming only the symmetry of Eq. ()2.1|) which takes into account 
that m* es 7^ 0. (See e.g. the NPR mixing matrix in Tables 12,13 of ref. [6], where 
mixings that violate SU(3)l x SU(3)r are not neglected.) 

Quark masses commonly used in numerical simulations are typically in the range 
of few times 10 MeV, and so am s res « 0.001 may already account for up to 10% of 
the physical sea-quark mass. We thus believe that, no matter what, one should avoid 
residual masses (be it m s res or, certainly, m^ es ) larger than 0.001. 

The good chiral symmetry of domain-wall fermions enhances a quenched pathol- 
ogy: due to the absence of the fermion determinant, contributions of physical zero 
modes grow like the inverse quark mass. While methods have been devised to deal 

16 This bound is obtained directly in terms of am^ es without using the "translation" Bq/(AhJ) 2 k a. 
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with this problem in the quenched theory JHJj clearly the only true solution is to 
un-quench. The pathology is removed when the physical sea and valence masses are 
set to the same value. To LO, this means 

™t = m% + m v res = m\ + m s res . (4.3) 

Observe that this does not require that m" es and m* es be equal. When m" es is negligibly 
small, this amounts to tuning the valence-quark mass ra? to m* + m^ es . In a numerical 
simulation, the correct procedure is to tune the valence-pion mass to the sea-pion 
mass. Another advantage of this choice is that enhancement of finite-volume effects 
is avoided [T4~] . 

5. Discussion 

Our analysis shows that, because power divergences are encountered in the valence 
sector only, the chiral-symmetry requirements on valence quarks are much more strin- 
gent in comparison to sea quarks. To keep systematic errors due to chiral symmetry 
violation at the 1% level, we estimate that one needs am v res < 10 -5 and am s rcs < 10~ 3 
at a^ 1 ~ 2 GeV. The obvious conclusion is to adopt the strategy of using different 
DWF actions for the two sectors. A more economic DWF action can, and should, 
be chosen for the sea sector; it is the sea sector that costs most of the computational 
effort, and at the same time this is where a much larger residual mass can be tolerated! 

The simplest way to choose different DWF actions for the two sectors is to take 
L\ < L\. This choice is a hybrid scheme in the sense of ref. jlj, with symmetry group 
G sea x G va i ence = U(N S ) x U(N V \N V ). For L5 = L5 the symmetry group enlarges 
to the group U(N V + N S \N V ) [TUj . and in that case the theory is what is usually 
referred to as partially quenched, rather than hybrid. While these are all rather 
trivial statements, they have one relevant consequence: The most general theory, 
with free parameters m q ,s and Ll' s , rigorously contains fully unquenched QCD as a 
special case, by choosing m q = m q , — L|. However, while simulations can easily 
obtain data points for m q = m s q , this is less likely the case for the size of the fifth 
dimension. The problem is that, as we have seen, choosing L\ = L s b for affordable 
values of L s b is unlikely to yield the high chiral accuracy needed in the valence sector. 
Note that one might choose hybrid schemes in which the valence- and the sea-quark 
DWF operators differ not only in the values of L 5 or m q , but also in other ways (see 
below) . 

Of course, spectral functions extracted from the mixed theory, in which quark 
masses have been tuned such that Eq. ([4.3]) holds, may suffer from violations of 
positivity at non-zero lattice spacing. However, provided the bounds discussed in 
Sect. 4 are satisfied, implying that a(m^. es — m^ es ) errors are not bigger than a 2 errors, 
this will show up as just another source of 0(a 2 ) scaling violations. If Eq. (|4.3|) does 
not hold, one expects unitarity violations to occur also in the continuum limit as in 
any partially-quenched theory. 

In the quenched case, changing the gauge action (to Iwasaki [TJ] or to DBW2 [20] ) 
turned out to be enough to push m ves to negligibly small values. In the dynamical 
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DWF case [21], fiddling only with the gauge action gives rise to m res = m s ves values 
which at present are marginally within our bound for two flavors, and larger than our 
bound for three flavors. Moreover, trying to use exactly the same DWF action for the 
valence sector would entail violating our m v ves bound by several orders of magnitude. 

The inherent reason for the larger m res values obtained in the dynamical case is 
the screening effect of the sea quarks, requiring a larger bare coupling in order to 
maintain the same lattice cutoff. This larger bare coupling pushes one closer to the 
Aoki phase [221 E31 121] ■ Inside the Aoki phase, DWF and overlap fermions j2E| cannot 
be used because it is impossible to maintain both chirality and locality I17j. 

To meet the bounds derived in this paper it is necessary to use better DWF 
actions (for both fermions and Pauli-Villars fields). We recall that the basic elements 
that go into a DWF construction are the four-dimensional "kernel" (for standard 
DWF, a super-critical Wilson operator) and the way the fermions are allowed to hop 
in the fifth dimension (for standard DWF, a free one-dimensional Wilson operator). 
Both play a role in the resulting Ginsparg- Wilson (GW) operator |2Zj obtained in the 
chiral limit L5 — > 00, and faster approach to the DWF chiral limit often gives rise to 
a GW operator with better locality properties. While the study of improved DWF 
actions is outside the scope of this paper, we briefly list several existing proposals. 

1. Accelerated convergence to the chiral limit [2E]- This method changes the couplings 
in the fifth dimension, in effect applying a Mobius transformation to the Wilson kernel. 
It includes as special cases the standard DWF formulation j2] and the "truncated 
overlap" of ref. [2H]. It shows promising results. It can be applied to both the 
sea- and the valence-quark sectors. The difference between the sea and the valence 
DWF actions can come not only from (m and) L 5 but also from two optimization 
parameters, see ref. [22j for details. 

2. Modified four-dimensional kernel. Modifications have been proposed based on the 
rate of approach to the chiral limit in perturbation theory j3U], and on approximate 
solutions [3T| of the GW relation which also improve the locality of the resulting 
overlap operator. These methods are again applicable to both sea and valence quarks. 

3. Modified Pauli-Villars action [221. This method is relevant only to the sea-quark 
sector. It aims to suppress the Boltzmann weight of dislocations that give rise to low- 
lying eigenmodes of the Wilson kernel. These eigenmodes allows for un-suppressed 
propagation in the fifth dimension, and are the dominant non-perturbative source of 
chiral symmetry violation. 

4. "Projection method" [S3]- I n this method, one adds to the DWF Dirac operator 
terms proportional to the projectors on low-lying eigenmodes of the Wilson kernel. 
This in effect raises the corresponding eigenvalues and eliminates the associated un- 
suppressed propagation in the fifth dimension. The method is consistent so long 
as these eigenmodes are exponentially localized. The method could be prohibitively 
expensive for the sea sector. As for the valence sector, the extra cost of computing the 
projection terms should be small compared to the cost of generating the dynamical 
configuration itself. Under favorable conditions, "projecting out" a relatively small 
number of Wilson eigenmodes can lead to a significant reduction of m^ es . 
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5. Another method which would be prohibitively expensive for the sea sector, but 
can be used in the valence sector, is to smear the gauge links before plugging them 
into the DWF Dirac operator j^l] . Again, this removes many of the dislocations that 
would otherwise give rise to low-lying eigenmodes of the Wilson kernel. 

Once dynamical DWF configurations with am s res < 10~ 3 have been produced 
there is a good chance that, with all the extra "tricks" available in the valence sector, 
it will be possible to meet the valence-sector bound am^ es < 1CT 5 as well, at affordable 
cost. 

In Sect. 3 we concluded that, when m v res = 0, the divergent part of T is JF 2 /a 2 = 
8a\j f 2 . Similarly, if we consider the twice-subtracted operator (Eq. (|H.24jl ) needed 
when m" es ^ 0, we have that ^/a 2 = 8a^/ f 2 as before, while T 2 /a 2 = 80 2 / f 2 - This 
follows from the LO analysis, but should remain true to NLO if indeed the NLO 
corrections to T and T' are always finite (c/. Conjecture 2). This suggests that 
a twice-subtracted operator may be defined by first performing one subtraction by 
imposing Eq. (|2.4|) . and then performing the second subtraction, Qtwo-sub = Qsub _ 
T'0' 2 , by imposing (7r + | Q tW o-sub = in the limit that the explicit valence quark 
mass is zero. The ChPT matching should then involve the corresponding NLO result 
T' = 862/ f 2 +0(m ves , m s ), where the NLO terms are again finite. Observe that, unlike 
the first subtraction, the second subtraction cannot be done without relying on ChPT 
to some order. This brings in new uncertainties (for instance, the convergence of the 
chiral extrapolation). Given these uncertainties, we believe that the necessary bound 
on am" rcs will fall somewhere in between 10 -5 and 10~ 3 in that case. 

Finally, it is interesting to see whether the considerations on the interplay of 
power divergences and chiral symmetry presented in this paper carry over to other 
fermion methods. For instance, as an alternative to using DWF, efforts are under way 
to use staggered valence quarks to determine K —>■ ttti from K — > ix matrix elements 
Naively, one expects that bounds on the maximum tolerable Goldstone-boson 
mass splittings coming from taste symmetry breaking would have to be similarly 
stringent as those on m v res . The reason is that taste symmetry breaking in the valence 
sector might again lead to the need to perform more than one power-like subtraction. 
However, in this case the problem is avoided by using only the exact Goldstone bosons 
on the external lines of the matrix elements under consideration. 
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Appendix A. Systematics of chiral symmetry violations 

In this paper, we assumed that finite-Z/5 chiral symmetry violations can be treated 
in ChPT by introducing a spurion field associated with the "residual mass." While 
the bare quark mass is a small parameter explicitly present in the DWF action, the 
same is not true for the residual mass; its smallness is a dynamical result. In this 
Appendix we discuss this dynamics in some detail, finding that a systematic treatment 
of residual chiral symmetry violations requires several spurions. However, this will 
not modify the NLO analysis of Sect. 3 (except for an inessential modification to 
Eq. ()3.23j) ) and, thus, it has no effect on our conclusions. 

At finite L 5 , the DWF's PCAC relation takes the form 

d-J^ = 2mJ« + 2J a 5q , (A.l) 

where for simplicity we assume equal quark masses. Here Jf is a DWF's partially- 
conserved non-singlet axial current and d~ is the backward lattice derivative. J5 is 
built out of fermion field residing on the boundaries, and interpolates a (continuum) 
pseudo-scalar density. Jg q is a lattice artifact that involves fermion fields residing 
inside the five-dimensional "bulk," half-way between the boundaries. The notion of 
a residual mass comes from the fact that the leading long-distance effect of J§ q can 
be expressed as J5 ~ m res J5 . This motivated the definition of the spurion M res in 
Eq. (j3.14J) . In this appendix we address the legitimacy of this approximation. 

Propagation in the fifth dimension is governed by a transfer matrix T, from which 
a hamiltonian H = —\og(T)/a^ may be defined, where 05 is the spacing in the fifth 
dimension. H is closely related to the Wilson operator, and in particular the two 
operators share identical zero modes. Each eigenmode of H can be characterized as 
extended or localized. Outside the Aoki phase, one expects the localized eigenmodes 
to lie below a mobility edge A c > 0, above which the eigenmodes are extended. Fol- 
lowing ref. we will now separately discuss the chiral symmetry violations arising 
from the extended and from the localized modes, in that order. 

In the presence of extended modes only, low-energy quark modes will have a 
factorized wave function iPdwf(x, s) — > qn(x)x(s) + qi(x)x(L 5 + 1 — s) where s = 
1,2, . . . ,L 5 is the fifth coordinate and qR,i(x) = |(1 ± Js)q( x ) is an effective four- 
dimensional quark field. This was derived in perturbation theory [30j, and is ex- 
pected to hold non-perturbatively as well. The fifth-coordinate wave function xi s ) 
is dominated by the extended modes near the mobility edge, 17 and is given by 
x(s) ~ exp(— a 5 sA c ). Since the left-handed and right-handed quarks reside on op- 
posite boundaries, each chirality flip will involve a factor of x(-^s) ~ exp(— a 5 L 5 A c ) 
when m = 0. This resembles a mass insertion with x(-^s) as the mass. Hence we can 

17 In perturbation theory the mobility edge is equal to the gap of the free H. See also the last 
paper of ref. [T7) . 
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account for it by introducing a mass-spurion M ext with expectation value 

M ext = diag(r< xt , . . . , m s ext , . . . , m v ext , . . .) , (A.2) 

where 

am^ ~ exp(-a 5 2#£) , (A.3) 

and similarly for m| xt . 

We next turn to the localized modes. Among these, the most dangerous are 
the zero modes of H, which give rise to un-suppressed propagation between the two 
boundaries. At a given L 5 , the term "zero modes" stands here for all modes with 
eigenvalue |A| < 1/L 5 . The probability (per unit volume) to encounter a zero mode is 
roughly given by p(0)/L 5 , where /5(A) is the spectral density of H, and l/L 5 accounts 
for the energy interval. 18 Evidently, the smallness of this effect hinges on having a 
small p(0). As before, we introduce a new mass-spurion M\ c \ with 

M M = diag(m^ cl , . . . , mf ch . . . , m v lch . . .) , (A.4) 

in which 

amr cl ~a 4 /f(0)/^, (A.5) 

etc. However, for the chiral symmetry violations coming from localized modes, this 
is not the whole story. Once a zero mode is found some place in the configuration, it 
allows for quarks of any flavor to propagate from one boundary to the other with no 
"penalty." Since the process is confined to the space-time region occupied by the zero 
mode, we anticipate that it can be described in terms of a local effective multi-fermion 
interaction. Given e.g. N s sea quarks, a single zero mode can certainly give rise to 
effective vertices containing 2,4,. . ., up to 2N S fermions, which induce one chirality 
flip per flavor. 19 

The Symanzik action is a continuum effective action in which the lattice artifacts 
occur in an expansion in the lattice spacing. In this effective action the multi-fermion 
vertices discussed above will have dimensionful coupling constants, each of which will 
be roughly equal to the common factor p(0)/L 5 , times an appropriate power of the 
lattice spacing. To account for this effect in ChPT [2H1 H] , we thus need to introduce 
a third spurion W transforming again in the same way as a mass spurion, and having 
an expectation value 

W = diag(w;, . . . , w, . . . , w, . . .) , (A. 6) 

with 

w ~ a 3 . (A. 7) 

Thus, e.g. a sea-sector 2n-fermion vertex (which induces n chirality flips) will occur 
at order mf cl a 3 *-™ -1 -* in the chiral expansion. 

We end up with the following two rules for residual chiral symmetry violations in 
ChPT: 

18 This corresponds to the third scenario described in appendix C.2 of ref. |3U| . 

19 This might be thought of as reminiscent of the 't Hooft vertices induced by instanton zero modes. 
We leave open how many chirality flips can be induced by a single zero mode of H in the case at 
hand. 
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1. The effects are accounted for by the three mass spurions M ext , M\ c \ and W . 

2. The spurion W cannot occur except in terms that already contain at least one 
power of M\ c \. (More precisely, this is true separately for the valence and sea 
sectors.) 

The values of all three spurions are to be defined in the limit in which the explicit 
quark mass has been set equal to zero. 20 ChPT will then automatically take care of 
the dependence of the residual mass on the explicit quark masses. Numerically, for 
the range of quark masses used in simulations, the variation in the residual mass is 
found to be at the level of a few percent [HH2HI, showing that it is indeed consistent 
to treat it higher order effect in ChPT. 

For small L 5 the dominant chiral symmetry violating effects will come from ex- 
tended modes. Since this effect is damped exponentially, above some value of L 5 
the localized zero modes will take over as the dominant source of chiral symmetry 
violations. Their effects have a lot in common with the chiral symmetry violations 
of Wilson fermions. The crucial difference is that, for DWF, these effects have a low 
probability given by p(0)/L 5 . This is the origin of Rule 2. 

It has been suggested that the treatment of DWF chiral symmetry violations 
in ChPT can be based on promoting the fifth-direction links located exactly in the 
middle of the fifth dimension to a spurion Q ^H] • However, adopting the more elabo- 
rate scheme of spurions developed above allows one to account more faithfully for the 
magnitude of residual chiral symmetry violations coming from the different dynamical 
sources. 

Were it not for W, we could define M res = M ext + M\ c \ and the ChPT treatment 
would be reduced to the single M res spurion introduced in Eq. (j3.14j) . While in general 
this is not the case, thanks to Rule 2 it remains true that the leading residual-mass 
dependence must be through the sum M ext + Mid. Let us now re-examine the results 
of Sect. 3. First, for m^ es = (i.e. for = m\ cl = 0), the results depend linearly 
on m° es . The distinction between the contributions of extended and localized modes 
therefore plays no role to NLO when m v res = 0. To follow the above rules, we would 
only need to perform the trivial substitution m* es = m| xt + mf cl 

When m\ cl is non-zero things get (yet) more complicated compared to Eq. (J3.23J) . 
This equation is quadratic in m^ es . Therefore, apart from setting m^ es = m^ xt + mf cl , 
there are additional NLO terms proportional to mf cl a 3 . This, however, has no effect 
on the bound (j4. ljl . which originates from the terms in Eq. ()3.23j) proportional to 
m^s m v . 

So far, we have encountered no difference between the way the explicit-mass 
spurion M and the new spurion M ext enter ChPT. This raises the question whether 
one can trade M and M ext for a single spurion M' = M + M ext . The answer depends 
on the context. For pure-QCD observables (such as spectrum), the only distinction is 
that the LECs corresponding to M and to M ext will in general exhibit different scaling 
violations. This is best seen by considering the effective four-dimensional lattice Dirac 

°One should be able to extract m ext and m\ c \ from a fit of function oi Lr,. 
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operator obtained by integrating out the five-dimensional bulk modes [31)]. 21 This 
Dirac operator will involve a new mass term with coefficient m ext ~ exp(— a^L 5 X c ). 
However, in addition it will contain irrelevant terms transforming in the same way 
as a mass term, with coupling constants of order exp(— asL 5 A c ). Since there are no 
corresponding irrelevant interactions for the explicit mass term, the resulting scaling 
violations will be different. 

However, when insertions of the effective weak hamiltonian are considered, one 
encounters the power divergences which are the main topic of this paper. In this case, 
the numerical results [HJ E] show that the difference between the LECs corresponding 
to m v and to m^ es , such as for example a\ and 8 2 , is roughly of the same size as these 
LECs. Therefore it is not possible to combine the explicit-mass spurion M with any 
of the residual-mass spurions. 
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